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1 Introduction 

Tao ]Tao, 2 007a| has recently proved the following theorem: 

Theorem 1.1 (Main Theorem). Let I > 1 be an integer. Assume T\, . . . , T; : X — > 

X are commuting, invertible, measure-preserving transformations of a measure space 
(X, B, p). Then for any /i, . . . ,// € L°°(X, B, p), the averages 

1 N 

n=l 

converge in L 2 (X, B, fx). 

The case / = 1 is the mean ergodic theorem, and the result can be viewed as a 
generalization of that theorem. The I = 2 case was proven by Conze and Lesigne 
JConze and Lesig ne, 1984], and various special cases for higher I have been shown by 
Zhang [Zhang, 1996 1, Frantzikinakis and Kra [Frantzikinakis and Kra, 2005], Lesigne 



[Lesigne, 1993 1, Host and Kra [Host and Kra, 20051 , and Ziegler JZiegler, 2007) . 



Tao's argument is unusual, in that he uses the Furstenberg correspondence princi- 
ple, which is traditionally used to obtain combinatorial results via ergodic proofs, in 
reverse: he takes the ergodic system and produces a sequence of finite structures. He 
then proves a related result for these finitary systems and shows that a counterexample 
in the ergodic setting would give rise to a counterexample in the finite setting. 

This paper began as an attempt to translate Tao's argument into a purely infinite 
one. The primary obstacle to this, as Tao points out (|Tao, 2007b |), is that the finitary 
setting provides a product structure which isn't present in the infinitary setting. In 
order to reproduce it, we have to go by an indirect route, passing through the finitary 
setting to produce a more structured dynamical system. The structure needed, however, 
is not the full measure theoretic product. What is needed in the finitary setting is a 
certain disentanglement of the transformations, which amounts to requiring that the 
underlying set of points be a product of I sets, with the i-th transformation acting only 
the i-th coordinate, together with a "nice" projection under a certain canonical factor. 
We obtain this in the infinitary setting using an argument from nonstandard analysis. 

A measure space with this property gives rise to measure spaces on each coordinate, 
but need not be the product of these spaces: it could contain additional measurable sets 
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which cannot be approximated coordinatewise. These additional sets turn out to be 
key to the proof, since they are in some sense "uniform": they behave, relative to 
the commuting transformations, as if they were random. Perhaps unsurprisingly, the 
behavior of such sets has turned out to be central to a proof of an infinitary analogue of 
the hypergraph regularity lemma by Elek and Szegedy [Ele k and Szegedy, 2007) . 

Using another nonstandard argument to pass from discrete averages to integrals, 
we show that the non-random functions can be approximated by certain functions of 
lower complexity in a certain sense. Proceeding by induction from low complexity 
to high complexity, we will be able to prove the theorem, using arguments which are 
essentially those given in |Tao , 200711 , translated to an infinitary setting. This second 
nonstandard argument has a Furstenberg-style proof as well; a similar argument, along 
with other variations, is given in |Towsner, 2008 1. 

We thank Jeremy Avigad for providing many helpful suggestions. We also thank 
Terrence Tao for answering a number of questions about his proof, and Tim Austin and 
John Griesmer for finding significant errors in earlier versions of this proof. 

2 Extensions of Product Spaces 

We wish to reduce convergence of the expression in Theorem 1 . 1 in arbitrary spaces to 
convergence in spaces where the transformations have been, in some sense, disentan- 
gled. The useful location turns out to be extensions of product spaces — that is, given 
an ergodic dynamical system Y = (Y, C, v, Ui, . . . , U{), we would like to find a system 
X = (n<<; Xi, B, /i, Tt, . . . , Ti) where each T{ acts only on the i-th coordinate, but 
which preserves enough properties of the original system that proving convergence for 
all L°°(X) functions is sufficient to give convergence for all _L°°(Y) functions. 

X naturally gives rise to a product space, taking B L to be the restriction of B to those 
sets depending only on the i-th coordinate, but we do not require that B be the product 
of the Bi, in general, B may properly extend the product. 

Given any such system, there is a maximal factor X' = (X' , B', fx \ B') in which all 
sets are TiTj X invariant for each i,j < I. We must either accept poor pointwise behav- 
ior, since, for example, this factor does not separate x from T{FJ x, or, as will do here, 
take X' to be a different set. Formally, we will want the property that if 7 is the projec- 
tion of Y[ Xi onto X', then for every i < I and almost every xi, . . . , art— 1, Xi+i, . . . , xi, 
the function x% 1— > 7(2:1, . . . , xi) is an isomorphism from (Xi, Bi, \i \ B,) to X'. This 
obviously requires that all the X; be pairwise isomorphic themselves (and further, that 
B be symmetric under any change of coordinates). 

This requirement is derived from the behavior exhibited by Tao's finitary setting. 
Here the product space is the finite measure space on Z l N and X' is the finite measure 
space on Z^r. The map 7 : l! N — > Z^r is just the map X\, . . . ,xi 1— > ^ iCj, which has 
the property that if we fix Xi for i ^ k, the map Xf. 1— » Y^ijtk x i+ x k is an isomorphism. 

Since ([\ Xi, B, fi, T\, . . . , T{) is not a true product space, we cannot rely on Fu- 
bini's Theorem. Since we nonetheless wish to integrate over coordinates, we have 
to rely on the use of certain invariant subsets to produce an analogous property. If 
e C [1, 1], we will write x e for an element of Yl iee Xf, we also write e for the comple- 
ment of e, and, when e is the singleton {i}, write i for the complement of {{]. Given 
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some x e , if i £ e then denotes the corresponding element of the sequence x e . Given 
two such variables, say, x e , Xe, will write x for the combination of these two vectors, 
that is 

(x e )i ifiee 
(x-e)i otherwise 



(x e , x e 



Note that this is not simply concatenation. For instance, if / is a function on Y[ Xi, we 
will often write f(xj;, z) as an abbreviation for f{x\, . . . , Xk-i, z, Xk+i, ■ ■ ■ , Xi). 

Definition 2.1. Given a measure space (Yli<i Xi, B, p), for k < I, let Br be the 
restriction of B to those sets of the form Hi^fe &i x X k where Bi C Xi (or having 
symmetric difference of measure with such a set). 

With respect to B^, we may identify elements of Yii<i Xi with elements of Yli^k Xi 
by discarding the fc-th coordinate. 

Definition 2.2. Let 1 a dynamical system. We say a measure disintegration exists for 
some factor ir : Z — > Z' if there is a map z' t— > ji z i from U to the space of measures 
on Z so that fi z t is supported on 7r _1 (z'), the map commutes with the group action (so 
HT rj z' — T g [i z i for each g and almost every z'), and for any f G L 2 (Z), 



where in particular, the right side is defined. 



This disintegration always exists given certain conditions on Z | Furstenberg, 1 98 1 1 , 



but in our case it is easier to prove that one exists outright than to arrange for those con- 
ditions to hold. We will be dealing with a dynamical system X = (IL<, Xi,B, p,T\, . . . ,T{) 
where each Tj acts only on the i-th coordinate, and where the measure algebra is a (pos- 
sibly proper) extension of the product measure rL*</P^> "*)• Furthermore, taking 
B e to consist of those sets depending only on coordinates in e, we will assume the 
measure disintegration onto (n»=4fc Xi, Bj, \i \ B-^) exists (we denote the correspond- 
ing measures in the disintegration /Lt& )X _). We want to be able to exchange coordinates, 
and further, to have an additional, I + 1-st, coordinate which can "stand in" for any of 
the other coordinates. 

This extra coordinate will be a factor X' = (X' , B' , p! , T[ , . . . , T{ ) of X such that a 
measure disintegration exists and the projection 7 : JX ; <; Xi — > X' is TiT^ 1 invariant 
almost everywhere (that is, for almost every x, 7(3;) = j(TiTj~ x) for each If 
we fix all but one coordinate of X, we obtain a function 7^- : Xk — * X' by setting 

Ixjr(xk) ■= j(x^,X k ). 

Definition 2.3. Let X have the form (!],</ X i: B, p., T u . . . , T t ), and let X' = (X' , B' , p' , T[, . . . , T{) 
be a factor q/X with 7 the corresponding factor map. We say X' cleanly factors X if: 

• 7 is T{r~ x invariant for each i,j 

• For each k < I and almost every x-£ £ Ili^fc Xi, the function 7 X _ is an isomor- 
phism from (Xk, B, Pk,x T ) onto X' 
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Recall that /Ltfc,^ is the measure given by the measure disintegration of (Jl^fe %i> &k> M t 
Sr) evaluated at icr. 

It is instructive to consider the finite case, where each is just the discrete measure 
space on [1, N], as is X', and the function j(x) is simply X)i<; ^ mod N, The clean 
factoring property here just asserts that if we fix all but one coordinate (and therefore 
y '■= J2i^k x i m °d N)> me map 7 X _, which is given by j x -(xk) = y + x mod N, 
is a one-to-one mapping. 

Theorem 2.4. IfY— (Y, C, v,T\, . . . ,T{) is an ergodic dynamical system with the Ti 
commuting, invertible, measure-preserving transformations and fi, ■ ■ ■ , fi S L°°(Y) 
then there is a dynamical system X := (JX;</ -X*, B, fi,Ti, . . . , T{) and functions fi, ■ ■ ■ , ft G 
L°° (X) such that: 

• For each of the factors (Jli^fc /i \ Bj, T\, . . . , TJ) a measure disintegra- 
tion exists 

• An X' ex/sfs which cleanly factors X 

• For eac/; i there is an Si such that T t has the form Ti{x\, . . . , Xi, . . . , Xi) = 
(x\ , . . • , SiXi, . . . xi) 

• If An (f 1, ...,//) converges in the L 2 norm then j4jv(/i, • • • , fi) does as well 

Note that in the first An above, the transformations in question are the Ti, while in 
the latter, the transformations are the Tj. The proof depends on arguments from non- 
standard analysis and the Loeb measure construction; see, for instance, [Goldblatt, 1998 ] 
for a reference on these topics. 

Proof. If v S [1,P] 1 , write T* 7 for T^ 1 • ■ -Tp. By a multidimensional version of 
the pointwise ergodic theorem (for instance, the general version of the theorem for 
amenable groups of transformations [Lindenstrauss, 2001 ]), for any function g and al- 
most every x, 

gdv = lim — - V g(T v x) 
j?e[i,P] ! 

A point with this property is called generic for g. Let Q be the set of polynomial 
combinations of shifts of the functions /j with rational coefficients. Since this is a 
countable set, we may choose a single point xq which is generic for every element of 
Q. For each g 6 Q, define 

g{n) := g^xo) 

Since the f i are L°° functions, we may replace them with functions uniformly bounded 
by some M/ 4 only changing them on a set of measure 0, and we may therefore assume 
that each g is bounded. 

Working in an Ni -saturated nonstandard extension, choose some nonstandard c. 
Using the Loeb measure construction, we may extend the internal counting measure on 
[1, c] to a true external measure /1 on the er-algebra generated by the internal subsets 
of [1, c] . The functions g := g* \ [1, c] , the restriction of the nonstandard extension 
of g, are internal, and therefore measurable, and bounded since each g is. 
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For each g 6 Q, by the definition of fi 



gd^i = st 



(? E rw) 




where st is the standard part of a bounded nonstandard real. Furthermore 



nG[l,c] ! / tTe[l,P]' 



follows by transfer: for any rational a greater than limp^oo -§r J2ve[i p] 1 5(^" x o) 
and for large enough P, a is greater than the average at P, so for all nonstandard c, a 
is greater than the average. Similarly for a less than the limit. Putting these together, 
for any g E Q, 

gdv = I gd\i 



Define 

fi(xi, ...,Xi,...,xi) = {x\, ...,(xi + 1) mod c,...,xi) 

It follows that Tig = Tig, and by ordinary properties of limits, ~ commutes with sums 
and products. Therefore in particular, 
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[A N (fi, ...,fi)- A M (fi, fi)} 2 dv= I [A N (h, ...,/,)- >M/i> ■ • • . fi)\ d» 

At each point x-^ in (Ili/fcll) C L V" \ &k)< me Loeb measure construction in- 
duces a measure n^.x- generated by setting 



Hk,a>T-{B) := st 



- E XB{x^n)\ 
\ «e[i,c] / 



for internal _B. 

Finally, let X' := ([1, c], S', /t') be given by the Loeb measure construction on 
[1, c], and let 7 : [1, cf — ► [1, c] be 7(0:1, . . . , xj) = mod c. The function 7 

is measurable (since it is internal), and measure-preserving (since it maps exactly c z_1 
points of [1, c] 1 to each point of [1, c]). For each n 6 [1, c], we may define 

/4(B) := si I Y Xb(v) 

\ uG[l,c]' v i— n mode 

for internal B and extend this to a measure on B by the Loeb measure construction. 

X is isomorphic to a product of X' with the Loeb measure on [1, c]' _1 . A theorem of 
Keisler ]Keisler, 1977[|Keisler, 1984 1 states that when U and V are hyperfinite sets and 
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/ is a measurable function on Loeb(U x V), the functions v i— > f(u, v) are measurable 
for each u and J J fdvdu = j fd(u x v). In particular, this means that for any 
measurable B, 11(B) = J fi n (B)dfj,'(n), so a measure disintegration exists. 

For any k < I and any xt, . .. ,Xk-i,%k+i, ••■,Xi G II#/J1j c ]> "Is is a measure- 
preserving bijection from [1, c] to itself mapping measurable sets to measurable sets, 
and therefore an isomorphism. □ 

Using the ergodic decomposition, we may reduce the main theorem to the case 
where X is ergodic, and then use Theorem l2.4l to reduce to the following case: 

Theorem 2.5. Let X = (]X<i %u A*) ^li • • • 1 1\) be a cleanly factored dynamical 
system such that each Tj has the form 

Ti (xi j . . . , Xt, . . . , x{*) — (x\ , . . . , Xj, , . . . xfj 

Then for any fi, . . . , /; in L°°(X), Ajv(/i, ■■-,/;) converges in the L 2 norm. 

In order to prove this theorem, we need a slightly stronger inductive hypothesis, 
which is what we will actually prove. 

Lemma 2.6. Let Y be an arbitrary measure space, and let X = (J^ i<; Xi , B, /1, T\ , . . . ,T{) 
be a cleanly factored dynamical system such that each Ti has the form 

Ti {x\ , . . . , Xi) . . . , x{) — (xi,..., Xi, . . . X\ ) 

Then for any fi, ...,/; in L°°(X x Y), A^(fi, ...,/;) converges in the L 2 norm. 

For the remainder of the paper, assume X has this form and that X' is the factor 
witnessing that X is cleanly factored, and let 7 be the projection onto this factor. By 
restricting to the factor generated by the countably many translations of the functions 
fi, we may assume X and X' are separable. 

3 Diagonal Averages 

Note that the projection 7 we have constructed is consistent with the transformations 
Ti, in the sense that 7(2;) = 7(7/) implies j(Tix) — j(Tiy). Furthermore, since 7 is 
TjT^-invariant, 7(2;) = j(y) implies that 7(Tjx) = 7(27?/), even if i ^ j. 

Definition 3.1. Define Ti + \ on X' such that for each x' G X', if ^(x) = x' then 
-f{Tix) = Ti+ix'. 

With the particular construction we have given, this definition makes sense point- 
wise. For arbitrary X' cleanly factoring arbitrary X, this is true only almost everywhere. 

We wish to reduce Lemma |Z6l to the case where X is ergodic. In order to apply the 
usual theorem for the existence of an ergodic decomposition (see |Glasner, 2003| ), the 
measure space must be a standard Borel space. It will be easier to take advantage of the 
fact that we are working with the L 2 norm, and get a weaker ergodic decomposition 
that suffices for our purposes. Let C be the factor consisting of sets which are Tj- 
invariant for each i and fix representations of E(f | C) for each / G L 2 (X). Let v be 
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the restriction of /i to C. For each point x e Yi Xi, we can define a measure fi x by 
/ fdUx — E(f | C)(x) with the property that fj fdfi x du(x) = J fdfi. Furthermore, 
the map x i— > fi x is T^-invariant for each i, since C is, and /i^ is ergodic for almost every 
x. (This argument is the first step of the ordinary proof of the ergodic decomposition, 
as given in [Glasne r72003| |, Theorem 3.42.) 

We may carry out the same construction on X' and observe that this preserves the 
clean factoring property, so it suffices to prove Lemma l2~6l in the case where /i is 
ergodic. 

We wish to extend X x X' to ensure that for each L 2 function / on X, the functions 
Xj, x 1 i — ► f(&k: 7x~( x ')) m ' e measurable with integral J fdfi. If X were simply a 
product Yli<i tn i s would occur automatically. Since this is not necessarily the case, 
however, we must copy over all the additional sets by swapping coordinates. 

Formalist], take the measure algebra B* to make measurable all functions g(x, x') 
such that x i— > J g{x, x')d/j,'(x') is and define 

J g(x,x')du = J g(x,x')df/(x')dfi 

Define X* := X t X X\ B*, v). 

Importantly, this retains a measure disintegration onto each coordinate: 

J f{x,x' ,y)dv = / y f{x,x' ,y)dv k ^ x ^du k 

where Vk,{x,x')- is the pushforward of Hk,x^ under 7 when k < I + 1 and Vi+\ tX {-) is 
/ • S x d/J,. 

Definition 3.2. By abuse of notation, we take i < I + 1, to be transformations 
on X* x Y where Tj(x, x',y) is given by (TiX,x',y) if i < I and Ti + i(x, x', y) := 
(x,T l+1 x',y). 

Since we will only refer to the product measures with Y, and to limit the prolifera- 
tion of measures, henceforth we let v denote the measure on X* x Y and /1 denote the 
measure on X x Y. We write \i k and v k for the restriction of /i and v to the cr-algebra 
of Tfc-invariant sets. 

To briefly summarize the construction up to this point, given a measure space 
Z and L°° functions /1, . . . we have constructed a space X* = (]X<; Xi x 
X', £>*, v, Ti, . . . , T i+ i) with functions fi,...,fi such that: 

• Convergence of An(/i, ... ,fi) implies convergence of An(/i, ■ ■ ■ , fn) 

• The transformations T{ each act only on the i-th coordinate 

• The space X* has a measure disintegration onto each coordinate, and onto the 
space of Xi -invariant functions for each i 

• There is a function 7 : ]X<a^i ~* X' cleanly factoring the space (JX^z Xi, B, fi, T\, Ti) 

1 John Griesmer suggested this simplified definition of B* 
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Definition 3.3. Let e C [1,1 + 1]. We say f G L 2 (X* x Y) is e-measurable if it is 
Ti-invariant for each i $ e. We define Xd := {e C [1,1 + 1]] | |e| = d}. We say f 
has complexity d if it is a finite sum of functions of the form Yl eeId g e where each g e is 
e-measurable. 

Lemma 3.4. If f e L 2 (X* x Y) is e-measurable for some e with |e| < I + 1 then 

f(x,j(x),y) isan L 2 function and \\f(x,j(x),y)\\ L 2 {XxY) = ||/|| L 2 (x . xY) . 

Proof. For any i £ e, 



since j Xi is an isomorphism. Since / is Ti-invariant and Xi is ergodic with respect to 
Ti, this is equal to 



Recall that xj, Xi is identical to the vector x. But the measure v was constructed so this 
is precisely 



□ 

In particular, this means that f(x, j{x), y) is an L 2 function when / e L 2 (X* x Y) 
has complexity d for some d < I + 1. 

Definition 3.5. /// e L°°(X* x Y) has complexity d, define A N f e L°°(X x Y) by 



We can reduce the question of the convergence of An to the convergence of A^: 



Definition 3.6. /// e L 2 (X x Y), define f(x, x' , y) := f(x T , 7" 1 (x / ), y). 
Note that P(x, T l+1 x', y) - f(xj, ^(T^x')^) = f( Xj , T^(x'),y). 



Lemma 3.7. Let fi,...,fi be given. Ajv(/i, • • • , fi) converges in the L 2 norm iff 







n»e{i,...,j} fi converges in the L 2 



norm. 
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Proof. 



1 N 

A W J] f}(x, y) =^J2Ii ^ilW- V) 



N 

n—l % 
N 

V 



n—l i 

i N 

n—l z 

1 N 



TV 

n—l 2 

=Ajv(/i, • • .,fi){x,y) 

□ 

Each /? is [1, Z + 1] \ {i}-measurable, so to prove the main theorem, it suffices to 
prove convergence of Ajy.9 for functions of complexity d < I + 1. 

While Ajv/ was defined as a function in L°°(X x Y), we will sometimes view it 
as the function in L°° (X* x Y) where x' is a dummy variable. 

Lemma 3.8. If g and f are L°° (X* x Y) functions with complexity d < I + 1 and g is 
T; + i- invariant then Aj^gf — gAjyf. 

Proof. Immediate from the definition. □ 

Lemma 3.9. Suppose g has complexity 1. Then A jyg converges in the L 2 norm. 

Proof. If for almost every y <G Y, we have convergence for x i— > g(x, y) then we may 
apply the dominated convergence theorem to obtain convergence over X* x Y. Since 
A m distributes over sums, we may further assume that g has the form Jl i gi where each 
gi is {i}-measurable. Then Ajyg — Jli^i+i 9i^N9i+i, an d by the previous lemma, it 
suffices to show that Ajvg;+i converges. But this follows immediately from the mean 
ergodic theorem. □ 

Because the inductive step generalizes the proof of the ordinary mean ergodic theo- 
rem, it is instructive to consider the form of that proof. The key step is proving that the 
function gi+\ can be partitioned into two components; these components are usually 
described as an invariant component g± and a component gr in the limit of functions 
of the form u — Ti + iu. Unfortunately, this characterization of the second set does not 
generalize. There is an alternative characterization, namely that g^ has the property 
that ||Ajvg-r|| converges to 0. This turns out to be harder to work with (and, in particu- 
lar, this characterization does not seem to give a pointwise version of the theorem), but 
it can be extended to a higher complexity versions. 

We will argue as follows: take a function of complexity d in the form Jl g e with 
each g e e-measurable, and argue that each g e can be written in the form g e ^± + g e _j, 
where g e _j is suitably random, so that ||Ajv<?e,T II ^e' II - * 0, while g e ^ is essentially 



9 



of complexity d — 1. If we observe that constant functions have complexity 0, the usual 
proof of the mean ergodic theorem has the same form. 



4 From Averages to Integrals 

We need a way to pass from discrete limits to an integral in order to apply the inductive 
hypothesis. 

Lemma 4.1. Let X = (X, B, /i) be a separable measure space and let b be a real 
number. For s < k, let X s be a factor o/X and {6 m ,m',s}m<m'eN be a sequence of 
i°°(X s ) functions bounded (in the L°° norm) by b. Let {m t }ten be a sequence such 
that 

m t 

i=l s<k 

converges weakly to f. Then there is a space Y = (Y,T>,u) and functions b s G 
L°°(X S x Y) such that f(x) = j Y\ b s (x, y)da for almost every x. 

Proof. Consider an Hi -saturated nonstandard extension of a universe containing X and 
the sequences {mt} and {b m ,m t ,s}- Then for each s < k, there is a nonstandard exten- 
sion of the sequence {tm,m f ,s}m<m,eN, which we denote b* m s . Let M := ra v for 
some nonstandard t', and let Y := [1, M], Y is a hyperfinitely additive measure space 
(taking the counting measure on Y), and so, by the Loeb measure construction, there is 
an external er-additive measure, a, extending it; we denote the resulting measure space 
Loeb(Y). 

The elements b* n M s are in (L°°(X S ))*. Fix some orthonormal basis {/jjiew for 
L 2 (X) consisting of L°° functions. For convenience, we may assume that for each 
s and each i, either /j G L 2 (X S ) or f is orthogonal to every element of i 2 (X s ). 
Then the nonstandard version of this sequence, {/,}* is a sequence indexed by i e N* 
which is an orthonormal basis of L 2 (X*), and further, for t £ N, the i-th element 
is just /*. Therefore the notation /* for i G N* for elements of this sequence is 
unambiguous. Define b*(x,y) = b* M s (x). Then 6* = J2iew* a iAv)f* for some 
a, ;S (y). Furthermore, if J b y ^„ luS fidfi = for each y, m t then a^ s = 0, and this is 
the case whenever is orthogonal to every element of L 2 (X S ). Since the function 
V l— * b* M s is internal, so is each a>i iS , so st o a^ s is measurable with respect to 
LoebiY). Furthermore, since /* and /* are orthogonal when i ^ j, 

iGN* 

Consider the infinite sum ^2 ieN (st o a i s (y))fi (x). The norm of each finite initial 
segment of this sum must also be bounded by b 2 , and so the infinite sum is convergent. 
Therefore we may define b s (x) :— X^gn^ ° a i,s{y))fi{ x )- Since b s is a bounded 
sum of elements of i°°(X x LoebiY)), b s G i°°(X x LoebiY)). Furthermore, since 
a^ s = whenever f t L 2 (X S ), it follows that b s G L°°(X x Loeb(Y)). 
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In addition, note that the sum over the tails, J2i>i ~M S^li \ a i,s(y)\ approaches 
0, so in particular, si(£ ieN ,\ N h E^Li \ a iAv)\) = °- 
Let g E L 2 (J€). Then for any e and all but finitely many t, 

— E / 9 TT b i,m t .sdfi + e > liminf — V / .9 TT b t . mt . s dn 



By transfer, it follows that 

M 



= 1 " s<fe 



M'll ^,M,^ + e > liminf — ^ / <? J] 6„„ t , s d M 

y=l J s<fc i=l J s<fe 



!/=l 

for each e > 0, and so 

M 



mE S'll ^,m, s <^* > liminf — ^ / g J[ b hmuS dfi 

y=l J s<k i=l J s<k 



We have 



M M 

<m E / « * II W***) - ^ E s'IIE <*. 



We may split the sum into the standard and nonstandard components, giving 



M 



J/=l s<fc 



ieN iGN*\N 



The product expands into sum of 2 fe components, which we may distribute out of the 
integral and across the average and standard part operator. Then the first component is 



M 

<mE 9 4 I1E^(« 

y=l s<fc iGN 



while all other components have the form 



sf (^E M* E a *^o(2/)/i n E a iAv)fidv*) 

y=i \ieN*\N / s#s ieD 3 

for some so, and where each D s is either N or N* \ N. These components are bounded 
by 

1 M 

S ^I?Ell E a Mo(2/)/i|U 2 (X) • \\g* 11 E a iAy)fi\\L 2 (X)) 



y=i 



^ s ^e( e K.o(y)Hi/iiiL'(x)) -\\9* n e a *.-(»)^i^( 
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However, as shown above, st{^ J^yLi E ieN *\N a ls (v)) = °- Since I If* U S7 t So T, i€D , a ^ (v)fi I U 2 ( 
is bounded by ||5*IU 2 (X) Il s ^ So E iG d s "i,»(y)/i|U 2 (x) < it follows that all but 
the first component must be 0. Therefore 



M - M 



y=l J s<k y=l s<k i£N 

But by the definition of this is equal to 

i M r 

It follows that 

f f ~ 1 M f ~ 1 mt /■ 

/ / .g n = si (]^ zj / ^ n y ^ d ^ - H ™s f e / g n k™,*^ 

J J s<k y=l s<fe m * i=l ^ s<fe 

A similar argument applies to the lim sup, so we conclude that JT g(x) Y\ s<k b s (x, y)d/j,da 
1 9(x)f(x)dfj, for any g. 

□ 



5 The Inductive Step 

We now return to the proof of Theorem 12.61 Let X = (Hi<z B> Mi • • • > T{) 
cleanly factored by X' be given, and let Y be an arbitrary measure space. Recall that 
I n is the set of subsets of [1, 1 + 1] with cardinality n. If e is a subset of [1, 1 + 1], we 
write e for the complement of e, that is, [1, 1 + 1] \ e. We continue to be concerned with 
functions belonging to L°°(X*). 

Definition 5.1. Let cq C [1,/+ 1] contain I + 1. Z ea is the subspace of the e$- 
measurable functions g such that for every sequence (<? e ) e eX| e \\{e } with each g e e- 
measurable, 

\\A N gl[g e \\^0 

e 

as N goes to oo. 

D eo is the set of e^-measurable functions generated by projections onto the e$- 
measurable sets of weak limit points of sequences of the form 

1 N 

n— 1 iGeo 

as N goes to infinity, for some k g" eo, where each hi is [1, / + 1] \ {i}-measurable. 

Lemma 5.2. If g is eo-measurable where I + 1 € eo, |eo| < d + 1, and g £ Z eo then 
there is an h G D eo such that J ghdfi > 0. 
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Proof. Let an eo-measurable g G" Z eo be given. Then there is a sequence (<7 e )e€Z| eo |\{e } 
where each g e is e-measurable and some e > such that ||Ajv(<7 ELex e ^e f e )H > 
e for infinitely many N. Set Jn := ^N(gY\ e£ x l { e ^e 9 e )- ^ or eacn sucn ^> we 
have 

f N A N (g Y[ 9e)d^ > e 2 

This means 



/■ 



N 



j ' Jf^M x 'y)9(x,Tr +1 -y(x),y) J] 9e(x, T? +ll (x), y)d^ > e 2 

n=l eeI| eo |,e^e 

For each e 7^ e , there is some « G e \ e, so we may assign to each g e some i such 
that g e is independent of and collect the g e into terms bi t N (independent on N, in 
fact), each a product of some of the g e , such that bi is independent of Zj. Since /at is 
[1, /]-measurable, we may also fold f N into bj+^jv, and we have therefore shown that 
there exist functions bi.N which are [1, 1 + 1] \ {z}-measurable such that 

r 1 N 

I ^Y,9{x,Tr +ll {x), y ) \{ b i , N (x,Tr + Mx),y)d f i > e 2 

n— 1 *£e 

Choosing some k £ e , and letting g'{x-^, x' , y) := g(x, x' , y) for almost any Xk, this 
becomes 

1 W 

/ ^y.^E II M^r^-^x'),^,^ > e 2 

n=liee 

for infinitely many N. Choosing a subsequence S of these N such that 

1 W 

n— 1 i 

converges, the projection ft, of /i' onto the eo-measurable sets witnesses the lemma. (In 
particular, since g is eo-measurable, J ghdfi = J gh'd^i > 0.) 

□ 

Lemma 5.3. Every e -measurable function g may be written in the form g±_ + g^ 
where g± G D ea and g T G Z eo . 

Proof. Consider the projection of g onto D ea . By the previous lemma, if g — E(g 
D eo ) is not in Z eo then there is an h G D eo such that J h(g — E(g | D eo ))d^i > 0; this 
is a contradiction, so g — E(g | D ea ) belongs to Z eo . □ 

We could proceed to show that this decomposition is unique, but this is not neces- 
sary for the proof. 
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Lemma 5.4. If g = IIeex d+1 3e and each g e 6 D e then A^g converges in the L 2 
norm. 

Proof. For convenience, assume g is in the stricter form IleeXd i i+iee9 e - This is 
without loss of generality, since if h = Yie£i d+1 i+i^e 9& trien we nave 

A N h Y[ 9e = hA N Yl 9e 

e£X d+1 ,l+l£e eeT d +i J+l£e 

First, assume each g e is a basic element of D e ; that is, there is a function g' e such 
that g e is the projection of g' e onto 3^ and g' e is a weak limit of an average of the form 

1 N 

n— 1 i 

Define 6? j!T[ := bfj(x^,T^^~^(x') 7 x\y). Then Lemma 14. 1 1 applies, so there exist 
functions bf such that 

g' e (x^,x',y) = / JJ&Kajjfj^a'j^dcr 

Since each g e is the e-measurable projection of this function, we may fold Xe.o into z, 
integrating over a larger measure space to give 

g e (x e ,x',y) = jY[K( x e 1 z',x',y)da' 

i 

Since each g e has this form, and these b\ are e \ {i}-measurable, it follows that g has 
complexity d — 1, so the result follows by the inductive hypothesis. 

If the g e are sums of basic elements of D e , the result follows immediately. If g e is a 
limit of such elements, each g e can be written g® + g\ where is a finite sum of basic 
elements of D e and the norm of g\ is small. Then J] 9e = Y,E<zx d UeeE 9o IlegE 9i- 
When E — Id, the result follows from the result for finite sums. When E / Id, 
the product contains some g\ , and sine g\ is e-measurable, it follows that | A j^g e \ \ < 
\\g e \\- Since the g e > are bounded in the L°° norm, || Ajv lie 5e| | < ^ EL I l# e l I f° r some 
constant b, so JleeB 9o Tie^E 9i nas sma U norm if E ^ 2d- d 

Using this, it is possible to prove Theorem l2.6l If g — Y\ ee j d+l ge(x, x' , y) where 
each g e is e-measurable then it suffices to show convergence at each y, since then 
the dominated convergence theorem implies convergence over the whole space. When 
/ + 1 e, we have A n g e f — g e A^f, so it suffices to show that A m9 converges where 
g has the form 

n * 

Then write each g e as g Cy ± + g e ,T- Expanding the product gives 

II 9e ,± 11 9e,T 
EC{eel d+ i\l+lee} e$E e£E 
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where each g e ^ is in Z e and each g e> x is in D e . Since A at distributes over sums, it suf- 
fices to show that each summand converges. When E is non-empty, A n Ti e <?E 9 e ^ TleeE 3 e T 
converges to the function by the definition of Z e . When E is empty, Lemma l5"l4l ap- 
plies. 
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